Efficient Updates of Uncertain Databases

نویسندگان

  • Andreas Hubmer
  • Reinhard Pichler
  • Vadim Savenkov
  • Sebastian Skritek
چکیده

Uncertain databases have evolved as an active area of database research, surveyed, for example, in [10].The possible worlds semantics [1] is commonly used to deal with uncertain data. Several representation systems for uncertain databases have been proposed to provide efficient storage and query facilities for a potentially big number of possible worlds, see, e.g., [14,4,8]. Antova et al. introduced U-relations [2], which guarantee polynomial data complexity for queries of positive relational algebra (RA, for short). U-relations have been implemented and are available in the MayBMS system [6]. As in the case of certain databases, we clearly also want to update uncertain databases and pose queries of unrestricted RA. In [3], an API for uncertain databases, which also covers updates, is presented. The paper mentions that, for updates, decompression of the succinct representation of the possible worlds may be necessary. However, it leaves open the details and also the question if decompression could at least partly be avoided by some extension of the representation system. While the evaluation of positive RA queries on uncertain databases has been studied extensively, there has been little work beyond positive RA, like queries with having clauses [7] and queries with one level of anti-join (not-exists [13]). Fink et al. [5] describe an approach for unrestricted RA queries, but in a formalism that does not exhibit polynomial time data complexity for computing possible answers. The goal of this work is to start the investigation of the update problem of U-relations and to tackle the problem of evaluating queries with anti-joins over this formalism. To this end, we will first define the anti-join operator for U-relations and show how to use it to model updates. It will turn out that the decompression of the succinct representation of possible worlds may indeed be a performance problem. We will therefore introduce an extension of the Urelation representation system and show that our new formalism may lead to an exponential decrease in the representation of an updated uncertain database. Our main contributions will be as follows.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

I-25: Recurrent Pregnancy Loss; Updates in Etiologies, Diagnosis and Management

Background -Recurrent pregnancy loss defined as two or more miscarriage before 20 weeks of pregnancy affecting 1-5 % or women in reproductive age .There are many etiologies have been suggested, like Genetic, Immunologic, Thrombophila, Endocrine and Anatomic; but in 50% of cases, the exact etiology remains uncertain. Endometrium acts as biosensor of embryo quality and endometrium itself contribu...

متن کامل

UPI: A Primary Index for Uncertain Databases Citation

Uncertain data management has received growing attention from industry and academia. Many efforts have been made to optimize uncertain databases, including the development of special index data structures. However, none of these efforts have explored primary (clustered) indexes for uncertain databases, despite the fact that clustering has the potential to offer substantial speedups for non-sele...

متن کامل

UPI: A Primary Index for Uncertain Databases

Uncertain data management has received growing attention from industry and academia. Many efforts have been made to optimize uncertain databases, including the development of special index data structures. However, none of these efforts have explored primary (clustered) indexes for uncertain databases, despite the fact that clustering has the potential to offer substantial speedups for non-sele...

متن کامل

Efficient Query Processing Techniques in Uncertain Databases

Query processing on uncertain data has become increasingly important in many real-world applications. In this paper, we present our works on formulating and tackling three important queries in uncertain databases, that is, probabilistic group nearest neighbor (PGNN), probabilistic reverse skyline (PRSQ), and probabilistic reverse nearest neighbor (PRNN) queries.

متن کامل

Linear Logic for Taxonomical Networks and Database Updates

The aim of this paper is to propose a logical way to handle uncertain knowledge and change. Databases, diagnostic, planiication, taxonomy are some of the domains concerned by this problem. This paper focuses on the means Linear Logic ooers to represent taxonomical networks and to perform updates of databases containing incomplete information. The two problems are rst expressed in graph theory: ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013